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ABSTRACT 

The 2'-deoxynucleoside containing the synthetic 
base 1 -[(2/?,4S,5R)-4-hydroxy-5-(hydroxymethyl)- 
tetrahydrofuran-2-yl)-1H-perimidin-2(3H)-one] 
(dPer) recognizes in DNA the 0®-benzyl-2'-deoxy- 
guanosine nucleoside (0^-Bn-dG), formed by 
exposure to /V-benzylmethylnitrosamine. Herein, we 
show how dPer distinguishes between 0^-Bn-dG 
and dG in DNA. The structure of the modified 
Dickerson-Drew dodecamer (DDD) in which 
guanine at position G"^ has been replaced by O^-Bn- 
dG and cytosine has been replaced with dPer to 
form the modified 0®-Bn-dG:dPer (DDD-XY) duplex 
[5'-d(G^G2C^)C*A^A^T^T^Y^G^°C^^G^2)-3']2 (X = O^-Bn- 
dG, Y = dPer) reveals that dPer intercalates into the 
duplex and adopts the syn conformation about the 
glycosyl bond. This provides a binding pocket that 
allows the benzyl group of 0^-Bn-dG to intercalate 
between Per and thymine of the 3'-neighbor A:T base 
pair. Nuclear magnetic resonance data suggest that 
a similar intercalative recognition mechanism 
applies in this sequence in solution. However, in 
solution, the benzyl ring of 0^-Bn-dG undergoes 
rotation on the nuclear magnetic resonance time 
scale. In contrast, the structure of the modified 
DDD in which cytosine at position 0^ is replaced 
with dPer to form the dG:dPer (DDD-GY) 
[5'-d(G^G^C^G'*A^A^T^T^Y?G^°C^ ^G^^)-3 ]2 duplex 
(Y = dPer) reveals that dPer adopts the anti conform- 
ation about the glycosyl bond and forms a less stable 
wobble pairing interaction with guanine. 



INTRODUCTION 

The alkylation of deoxyguanosine in DNA at the 
position, exemplified by exposure to A^-benzylmethylni- 
trosamine and the formation of 0''-benzyl-2'-deoxy- 
guanosine nucleoside ((?*-Bn-dG), is cytotoxic (1) and 
mutagenic (2,3)- The O -Bn-dG lesion is representative of 
bulky DNA adducts involved in the initiation of gene mu- 
tations (4,5). It predominantly causes G^A transitions 
(6,7) and is observed in human cells (8). The development 
of synthetic nucleotides as chemical probes enabling 
site-specific reporting of such DNA damage is of 
interest (9-22). The 2'-deoxynucleoside containing the syn- 
thetic base l-[(2i?,45,5i?)-4-liydroxy-5-(hydroxymethyl)- 
tetrahydrofuran-2-yl)- 1 //-perimidin-2(3//)-one] (dPer; 
Chart 1) recognizes the O -Bn-dG in DNA (23). Analyses 
of DNA containing 0^-Bn-dG or dPer paired opposite 
each other or natural bases have demonstrated that the 
0^-Bn-dG;dPer pair is more stable than any pairing of 
the damaged base opposite any natural base, or of dPer 
opposite a natural base (23). Developing an understanding 
of the structural basis for 0^-Bn-dG recognition is critical 
to further developing nucleosides such as dPer to recognize 
these mutagenic lesions. 

Herein, we explain the molecular basis by which dPer 
distinguishes between <9''-Bn-dG and dG. When deoxy- 
guanosine at position G"* of the Dickerson-Drew 
dodecamer (DDD) (24) was replaced by 0*-Bn-dG, and 
deoxycytosine was replaced with dPer to form 
the modified O'^-Bn-dGidPer (DDD-XY) duplex 

[5'-d(C'G^e'X^A^AVT**Y^G'*'c"G'^)-3']2 (X = C^-Bn- 
dG, Y = dPer) (Chart 1), dPer intercalated into the 
duplex and adopted the syn conformation about the 
glycosyl bond. This provides a binding pocket that allows 
the benzyl group of 0^-Bn-dG to intercalate between Per 
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O^-Bn-dG; OS-benzyl-2'- dPer; 1-[(2R,4S,5H)-4-hyclroxy- 

dpnxvauanosinp 5-(hydroxymethyl)tetrahydrofuran- 

deoxyguanosine g-yl)-! H-perimidin-2(3H)-one] 

B DDD-XY (NMR) 



-10^11^12 . 



3'-d(Gl2c"G"Y' T» t' a' X« )-5' 



DDD-XY (crystallography) 



5'-d(Cl G^ X* A^ a' t' T« y' g"c"g12)-3' 
3 ■ -d ( g2*c"g"y"t2°t"a"a"x"c"g"c" ) -5 ' 



DDD-GY 



y' g"c"g12) 
3'-d(G"c"G"Y' a' A^ G^ )-5' 

Chart 1. (A) Structures of O'-Bn-dG and dPer. (B) Sequences and 
numbering of the DDD-XY, DDD-XY and DDD-GY DDDs. The 
dodecamer exhibits pseudo-dyad symmetry, and in NMR spectra, 
both strands are numbered identically from nucleotides C' to G'^. 
The two strands are not symmetry related in the crystalline lattice 
and in crystallographic structures, the nucleotides are numbered from 
C' to G'^ in the first and from C'"' to G^'^ in the second strand. 

and thymine of the 3'-neighbor A:T base pair. In contrast, 
when deoxycytosine at position is replaced with dPer 
to form the duplex containing the dG:dPer base pair 

(DDD-GY) [5'-d(C'G2C^G'^A^A*T^T**Y''G"'C"G'^)-3']2 
(Y = dPer), dPer adopts the anti confonnation about the 
glycosyl bond and forms a less stable wobble pair with 
deoxyguanosine. 



MATERIALS AND METHODS 

Materials 

The unmodified 5'-dCGCGAATTCGCG-3' (DDD) was 
synthesized by the Midland Reagent Company 
(Midland, TX) and purified by anion-exchange high-per- 
formance hquid chromatography. The modified oH- 
godeoxynucleotides were synthesized using an ABl 394 
DNA synthesizer (Applied Biosystems, Foster City, CA) 
or a Mermade 9 DNA synthesizer (Bioautomation, Irving, 
TX) using P-cyanoethyl phosphoramidite chemistry. The 
dPer (23) and 0^-Bn-dG phosphoramidites were prepared 
as described previously (25). The yields of the stepwise 
coupUng reactions were monitored by trityl cation 
response. The ohgodeoxynucleotides were removed from 
the resin by treating with 18 M (saturated) ammonium 
hydroxide for 1.5 h at 25°C. After filtration, the result- 
ing solutions were heated at 55°C for 6h to deprotect 
the ohgodeoxynucleotides. All ohgodeoxynucleotides 
were purified by semi-preparative reverse-phase high-per- 
formance hquid chromatography (Phenoinenex, Phenyl- 
Hexyl, 5)im, 250 mm x 10.0 mm) equilibrated with 0.1 M 



triethylammonium acetate (pH 7.0). The ohgodeoxy- 
nucleotides were desalted by passing over Sephadex G- 
25 and characterized by matrix-assisted laser-desorption- 
ionization time-of-flight mass spectrometry. The concen- 
trations of single-stranded ohgodeoxynucleotides were 
estimated by ultraviolet (UV) absorbance at 260 nm 
on the basis of an extinction coefficient of 
1.11 X 10^M~'cm~', which was not adjusted for the 
presence of the modified bases (26). The ohgodeoxy- 
nucleotides were annealed by heating to 80° C for 15min 
and then cooled to room temperature. 

Thermal denaturation studies 

Melting temperatures were measured with a Varian Cary 
100 Bio spectrophotometer operated at 260 nm. The 
buffer used for measurements contained lOmM sodium 
phosphate, 50|.iM NajEDTA and 0.1 M NaCl (pH 7). 
The temperature was increased from 10 to 80° C at a 
rate of 0.5°C/min. Melting temperatures were calculated 
from first-order derivatives of the absorbance versus tem- 
perature profiles. The concentration of DNA was 1.5 |iM. 

Crystallizations and data collection of the 
DDD-XY duplex 

Crystallization trials were performed with the Nucleic 
Acid Mini screen (27) (Hampton Research, Ahso Viejo, 
CA). The hanging drop vapor diffusion technique was 
used. DNA was desalted and prepared in water at 
1.2mM concentration. Droplets with volume 2^1 of a 
1:1 mixture of sample and mini-screen buffer were 
equihbrated against 0.75 ml of 35% 2-methyl-2,4- 
pentaiiediol (MPD) at 18°C. Two crystals were obtained 
and found to be suitable for data collection. The first was 
crystalized from 10% MPD, 40itiM sodium cacodylate, 
12mM spermine tetra-HCl and 80 mM KCl, 20 mM 
BaCl2 (pH 7.0). The second was crystallized from 10% 
MPD, 40 mM sodium cacodylate, 12 mM spermine tetra- 
HCl, 40 mM LiCl and 80 mM SrCb (pH 7.0). Crystals 
were mounted in nylon loops and frozen in hquid 
nitrogen. Diffraction data were collected in a cold 
nitrogen stream on beamhne 21-ID-F at LS-CAT, APS 
(Argonne National Laboratory, Argonne, IL) for both 
crystals. Single-wavelength anomalous dispersion (SAD) 
data were collected on the 21-ID-D beamline for the 
first crystal at the energy corresponding to absorption 
peak for the Ba atom. All data were processed with the 
program HKL2000 (28) and XDS (29). 

Crystal structure determination and refinement of the 
DDD-XY duplex 

The PHENIX (30) software was used to calculate phases 
and initial placing of the model into the electron density 
map froin the SAD data for the first crystal, which was 
crystallized with BaCl2. Then, initial refinement of the 
model was performed with the Computer and Network 
Systems (CNS) (31) program (National Science 
Foundation), setting aside 5% randomly selected reflec- 
tions for calculating the Rfree- Rigid body refinement 
and simulated annealing were performed. After several 
cycles of refinement, the emergent model was used as the 
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starting model for phasing by molecular replacement 
methods for a data set obtained from the second crystal. 
Multiple rounds of coordinate refinements and simulated 
annealing led to an improved model for which sum (2Fo- 
Fc) and difference (Fo-Fc) Fourier electron density maps 
were generated. At a later stage solvent, water molecules 
were added on the basis of Fourier IF^-F^ sum and Fq-F^ 
difference electron density maps. Water molecules were 
accepted based on the standard distances and B-factor 
criteria. Further, structure refinement was performed 
using the program REFMAC in the Collaborative 
Computational Project Number 4 software suite (CCP4) 
(32). Geometry and topology files were generated for the 
0^-Bn-dG and dPer modified bases, and anisotropic tem- 
perature factor refinement was performed afterward. The 
programs TURBO-FRODO (33) and COOT (34) were 
used to display electron density maps. Helicoidal analysis 
was performed using the CURVES+ web server (35). 

Nuclear magnetic resonance 

The DDD-XY and DDD-GY modified duplexes were 
prepared at concentrations of 0.56 mM and 0.53 mM, 
respectively. The samples were prepared in lOmM 
NaH2P04, 0.1 M NaCl and 50 NaaEDTA (pH 7.0). 
To observe non-exchangeable protons, the samples were 
exchanged with D2O. The DDD-GY duplex was dissolved 
in D2O. The DDD-XY duplex was dissolved in 9:1 
D20:CD3CN. For the observation of exchangeable 
protons, the samples were dissolved in 9:1 H20:D20. 'H 
NMR spectra for DDD-XY duplex were recorded at 
900MHz at 10°C and 500 MHz at 7°C. 'H nuclear 
magnetic resonance (NMR) spectra for DDD-GY duplex 
were recorded at 800 MHz in D2O at 10°C and 600 MHz in 
9:1 H20:D20 at 5°C. Chemical shifts were referenced to 
water. Data were processed using TOPSPIN software 
(Bruker Biospin Inc., Billerica, MA). The nuclear 
Overhauser effect spectroscopy (NOESY) (36,37) and 
double quantum filtered correlation spectroscopy (DQF- 
COSY) (38) spectra in D2O were collected at 10°C; 
NOESY experiments were conducted at mixing times of 
150, 200 and 250 ms with a relaxation delay of 2.0 s. The 
NOESY spectra of the modified samples in H2O were col- 
lected with a 250 ms mixing time, with a relaxation delay of 
1.5 s. Water suppression was performed using the 
WATERGATE pulse sequence (39). 

NMR experimental restraints 

The NOESY spectra were processed using the TOPSPIN 
software (Bruker Biospin Inc., Billerica, MA), and the 
spectral data were evaluated using the program 
SPARKY (40) to obtain the cross-peak assignments. 
The intensities of cross-peaks were measured by volume 
integrations. Experimental intensities were combined with 
intensities obtained from complete relaxation matrix 
analysis (CORMA) of starting model to generate a 
hybrid intensity matrix (41,42). The intensities were con- 
verted to distances with the program MARDIGRAS, 
which refined the hybrid intensity matrix (43). 
Calculations were performed using 150, 200 and 250 ms 
mixing time data and 2, 3 and 4 ns isotropic correlation 



times. Evaluation of the resulting distance data allowed 
creation of upper and lower bound distance restraints that 
were used in restrained molecular dynamics (rMD) calcu- 
lations. Additional empirical base pair, backbone and 
deoxyribose pseudorotation restraints for base pairs not 
proximal to the sites of modification were obtained from 
canonical values derived from B-DNA (44). 

rMD calculations 

An unmodified B type DNA model was used as a starting 
structure. The cytosine at position C' in each strand was 
replaced by dPer with INSIGHT II (Accelrys Inc., San 
Diego, CA). Partial charges for Per were calculated with 
the B3LYP/6-31G* basis set in GAUSSIAN (45). The 
starting structure was energy minimized for 1000 cycles. 
A simulated annealing protocol (46) was used for the rMD 
calculations, which were conducted with the parni99 force 
field, using the program AMBER (47). Force constants of 
32 kcal mol~' A~^ were applied for distance restraints. The 
generalized Born model (48) was used for solvation. The 
salt concentration in all calculations was 0.1 M. Coupling 
of the molecule to the bath temperature was used to 
control the temperature during simulated annealing. 
First, calculations were performed for 20 ps (20000 
steps) by the foUowing protocol: During steps 0-1000, 
the system was heated from 0 to 600 K with a coupling 
of 0.5 ps. During steps 1001-2000, the system was kept at 
600 K. The system was then cooled from 600 to lOOK 
during steps 2001-18000 with a coupling of 4 ps. 
Further coohng from 100 to OK occurred during steps 
18 001-20 000 with a couphng of 1 ps. After initial cycles 
of refinement a longer 100 ps (100000 steps), calculation 
was performed by the following protocol: During steps 
0-5000, the system was heated from 0 to 600 K with a 
couphng of 0.5 ps. During steps 5001-10000, the system 
was kept at 600 K. The system was cooled from 600 to 
100 K during steps 10001-90000 with a coupling of 4 ps. 
Additional cooling from 100 to OK occurred during steps 
90001-100000 with a coupling of 1 ps. Structure coord- 
inates were saved after each cycle and were subjected to 
potential energy minimization. Nine refined structures 
calculated from the different starting structures were 
chosen based on the lowest deviations from the experi- 
mental distance and dihedral restraints and energy 
minimized to obtain an average structure. CORMA 
(41,42) was used to compare intensities calculated from 
these emergent structures with the distance restraints. 
Helicoidal analysis was performed using the CURVES+ 
web server (35). 



Data deposition 

The complete structure factor and final coordinates 
were deposited in the Protein Data Bank (www.rcsb. 
org): the PDB ID code for the DDD-XY duplex 
is 4HQI and for the DDD-GY duplex the PDB ID 
code is 2M11. Supplementary Table SI contains the CIF 
file. 
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RESULTS 

Thermodynamic studies 

The unfolding of the DDD, DDD-XY and DDD-GY 
duplexes was examined by temperature-dependent UV 
spectroscopy. The I'm values were deteimined by taking 
the first derivatives of the resulting UV melting curves. 
The melting temperature for the DDD was 45°C, for 
DDD-GY duplex was 28° C and for the DDD-XY 
duplex was 33°C. Notably, for the DDD-XY duplex, the 
presence of the dPer base complementary to 0^-Bn-dG 
increased the of by 5°C as compared with the DDD- 
GY duplex, in agreement with prior observations that 
dPer thermodynamically discerns the presence of 
O'^-Bn-dG (23). 

Structure of the DDD-XY duplex 

Two crystals suitable for data collection were obtained. 
The first was crystalized from the buffer containing 
20 mM BaCl2. This crystal diffracted to 1.95 A. The dif- 
fraction data were processed in space group Plililx 
(orthorhombic). The processing and refinement param- 
eters are shown in Supplementary Table S2. It was not 
possible to complete the phasing utilizing molecular 
replacement approaches. Instead, the experimental 
phases were obtained from SAD data collected at the 
energy corresponding to absorption peak for Ba. From 
these data, phasing was accomplished, as was the initial 
placement of the model into the electron density map. 
Then, initial refinement of the model was performed, 
setting aside 5% randomly selected reflections for 
calculating the Rtvee- Rigid body refinement and simulated 
anneahng produced a structure that was in good agree- 
ment with the experimental electron density. A second 
crystal, crystalhzed from the buffer containing 80 mM 
SrCl2, diffracted at the greater resolution of 1.7 A, also 
in space group P2\2i2x (orthorhombic). The data from 
the second crystal were phased using molecular replace- 
ment methods in which the structure from the first crystal 
was used as a starting model. Multiple rounds of coord- 
inate refinements and simulated annealing led to an 
improved structure for which sum (IFo-F^) and difference 
(Fo-F^.) Fourier electron density maps were generated. 

A total of 49 water molecules were added on the basis of 
Fourier IFo-F^ sum and Fo-F^ difference electron density 
maps. These were accepted based on the standard dis- 
tances and B-factor criteria. One Sr^^ ion was identified 
in the electron density map based on its low B-factor and 
the characteristic geometry, as well as one spermine. The 
cell parameters (a = 26.38, b = 36.77, c = 77.65, a = 90.0, 
P = 90.0, J = 90.0, Supplementary Table S3) were 
atypical for the DDD duplex. The volume of the unit 
cell was greater. The overah structure of the DDD-XY 
duplex is shown with waters, Sr^"^, and a spermine 
molecule in Supplementary Figure SI. Although no 
electron density was observed for the 5'-terminal bases 
C' and C'^^, and thus their positions could not be 
determined with certainty, the 3'-terminal bases G'^ and 
G^"* rotated out of the duplex toward the major groove of 




Figure 1. The DDD-XY duplex in the region of the C':G^^ X'':Y^' 
and A^:T^° base pairs, showing the electron density map. The dPer 
nucleotide recognizes the benzyl group of 0''-Bn-dG (X'') via a 
stacking interaction such that the benzyl ring intercalates between the 
T^° and Y^' bases. 



adjacent molecules. The crystal data collection and refine- 
ment statistics are compiled in Supplementary Table S3. 

Figure 1 shows the DDD-XY duplex in the region of 
the C^:G^^ X^:Y^^ and A^:T^^ base pairs. Both the 0^-Bn- 
G and Per bases fit well into the electron density map. 
0*-Bn-dG remained in the anti conformation about the 
glycosyl bond. In contrast, the dPer nucleoside adopted 
the syn conformation. The intercalation of the Per base 
created a binding pocket into which the benzyl ring of the 
O'^-Bn-dG base was inserted. The benzyl ring of the 
0*-Bn-G base also formed a stacking interaction with 
T^" of the 5'-neighbor A^iT^" base pair. The simultaneous 
insertion of both dPer and the benzyl ring of the C>''-Bn- 
dG base increased the helical rise between neighboring 
base pairs C^'iG-^ and A^:T^° to 9.5 A, as compared with 
the anticipated rise of ~6.8 A in B-DNA (Supplementary 
Figure S2). It also unwound the duplex. For the modified 
duplex, the twist at base pairs C^:G^' and X'^iY^' was 
— 15°, whereas for the unmodified duplex it was 25°, a 
change of 40° (Supplementary Figure S2). For the dPer 
phosphodiester backbone angles a and y changes of ~210° 
were observed compared with the unmodified duplex 
(Supplementary Figures S3 and S4). The / angle for 
dPer was in the range of 60-80°, which was consistent 
with the .syn conformation (Supplementary Figure S5). 
The intercalation of the Per base, which was located 
between 0''-Bn-G and the 5' neighbor cytosine in both 
strands, increased the ^ angles of C^^ and C'^ by 90° 
(Supplementary Figure S5). Watson-Crick base pairing 
at the neighbor base pairs C^:G^^ and A^:T'° was not 
disturbed. Figure 2 illustrates the stacking between the 
benzyl ring of O^-Bn-dG and dPer. 

NMR spectroscopy of the DDD-XY duplex 

To ascertain whether the crystallographic structure for the 
DDD-XY duplex was representative of the solution struc- 
ture, a series NMR spectroscopy experiments were 
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duplex. (Left panel) Stacking of the C^^:G^^ base pair (black) above 
X" (blue) and Y^' (green). (Right panel) Stacking of the X^-.Y^' pair 
(in blue and green, respectively) above base pair A^iT^" (black). The 
benzyl ring of 0*'-Bn-dG (X"*) is stabilized by intercalation between T^" 
and Y"^' (dPer) bases. 



conducted. These proved to be challenging, as the 
resonances associated with the (?''-Bn-dG:dPer pairing 
interaction exhibited severe line broadening. The source 
of the hne broadening was not identified but may be 
attributed to rotation of the benzyl ring of (9*-Bn-dG at 
a rate intermediate on the NMR time scale. However, it 
was observed that the addition of 10% CD3CN to the 
solvent as an organic modifier resulted in significant line 
narrowing to the resonances associated with the O^-Bn- 
dG:dPer interaction. The addition of the organic modifier 
did not otherwise affect the NMR spectrum of the duplex, 
and the duplex structure of the DNA was maintained 
('vide infra'). The 0*-Bn-dG benzyl protons were 
observed as three signals between 6.6 and 7.4 ppm 
(Figure 3). All gave cross-peaks with the X"^ methylene 
protons Hmi, Hm2 (1 c, 2c, 4c, 5c, 6c, Figure 3c). This 
indicated that in the presence of the organic modifier, 
rotation of the benzyl ring was rapid on the NMR time 
scale. The resonance located farthest downfield at 7.3 ppm 
was assigned as X"* H^^i^, whereas the resonance located 
farthest upfield at 6.7 ppm was assigned as X"^ Hortho- The 
X"^ Hpara protou was assigned at 7 ppm. Cross-peaks 
were observed between Hortho^ Hmeta (Id, Figure 3d), 

Hmeta^Hpai-a (2d, FigUrC 3d), Hortlio^Hp^a (3d, 

Figure 3d). Interstrand cross-peaks between the X"* 
benzyl ring and T^ CH3, H2', H2" were observed 
(la, 2a, lb, 2b, 3b, 4b, Figure 3a and b). A weak cross- 
peak between H9 and X"^ Honho was observed 
(7d, Figure 3d). The dPer (Y"^) resonances were observed 
upfield from the benzyl ring protons of 0''-Bn-dG (X"*). 
Cross-peaks between dPer hydrogens are shown in 
Figure 3d (4d-6d, 8d-14d). Additional cross-peaks 
between the dPer base and its deoxyribose were identified 
(5b-8b, 8c-llc, Figure 3b and c). One weak interstrand 
cross-peak was identified between C"* H2' and Y' H6 
(7b, Figure 3b). 

In the sequential NOE connectivity between base 
aromatic and deoxyribose HI' protons (49,50), a number 
of the anticipated NOEs were weak (Supplementary 
Figure S6). These included the C^ H6^C^ HI', the C^ 
Hl'^X"^ H8, the X"^ HS^X"^ HI' and the X"^ Hl'^A^ 
H8. Also, the T^ Hl'^T** H6, T*^ H6^T** HI', T** 
Hl'^Y' H8, Y' H8^Y' HI' and Y"* HI'^G'" H8 
NOEs were weak. The magnitude of the Y' H^'^Y'' HI' 
NOE was consistent with the syn conformation of the 
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Figure 3. The NOESY spectrum for the DDD-XY duplex, showing the 
assignments of the O^-Bn-dG and dPer protons. The cross-peaks in 
(a) are assigned as follows: la, T** CHj^X'' H„eu,; 2a T* CHj^X" 
Horiho- The cross-peaks in (b) are assigned as follows: lb, T** H2'—>-X'* 
H,„eia; 2b, T** nr^X" H„rti,; 3b, T** m'^X" Hortho; 4b, T** H2"^X'' 
Hortho; 5b, Y" H27H2'WY'' H8; 6b, Y' H27H2"^Y'' H9; 7b, 
H2'^Y'' H6; 8b, Y' H27H2"^Y' HI'. The cross-peaks in (c) are 
assigned as follows: Ic, Hmi^X'' Hn,eta; 2c, X'^ Hm^^X"* Hmem', 
3c, Y** H4'^X'' H^eu,; 4c, X" Hmz^X" Hp„„; 5c, X"* Hm,^X'' Hortho; 
6c, X" Hm2-^X* Hortho; 7c, Y** H4'^X'' Hortho; 8c, Y** H4' ^Y' H8; 
9c, Y' HS'^Y' H9; 10c, Y' H4'^Y'' HI'; 11c, Y' HS'^Y' HI'. The 
cross-peaks in (d) are assigned as follows: Id, X"* Horiho^ H„eia; 2d, 
X" Hp„„^X^ Hn,eu,; 3d, X" Hortho^ X" Hp„„; 4d, Y'' HI'^Y" H8; 5d, 
Y' H6^Y'' H8; 6d, Y' HV^Y'* H8; 7d, Y' H9^X'* Hortho; 8d, Y' 
Hg^Y' H8; 9d, Y' H4^Y'' H5; lOd, Y** He^Y** H5; Ud, Y'* 
H7^Y' H5; 12d, Y' HI'^Y' H9; 13d, Y' H7^Y^ H9. The 
spectrum was collected at 10°C at 900 MHz, using a 250 ms mixing 
time. 



dPer nucleotide about the glycosyl bond. In the imino 
and amino proton regions of the spectrum, the Y' imino 
proton could not be identified (Supplementary Figure S7). 
This was attributed to rapid exchange with solvent. Thus, 
in the sequential connectivity of the base imino protons 
(51), no T*^ N3H^Y'' imino or Y' imino^G'" NIH NOE 
was observed. The A^ H2^T** N3H NOE was weak as 
compared with the A* H2^T^ N3H NOE. 



Structure of the DDD-GY duplex 

To determine the basis by which dPer differentially 
recognized the (9*-Bn-dG adduct versus dG, the structure 
of dPer placed opposite dG (DDD-GY) was determined. 
This duplex was not amenable to crystallographic 
analysis. However, it was possible to complete a structural 
determination by NMR. Assignments between aromatic 
protons of the base to deoxyribose HI' protons are 
shown in Figure 4. At the G'^iY' base pair, the C^ 
H6^C^^ HI', C^ Hl'^G'' H8, G"^ H8^G'^ HI', G"^ 
Hl'^A^ H8 and A^ H8^A^ HI' cross-peaks were 
observed and were of normal intensities. Also, the T^ 
H6^T^ HI', T** Hl'^Y"^ H9, Y' H9^Y'^ HI', Y"* 
H1'^G'° H8 and G'" H8^G"' HI' NOEs were 
observed and were of normal intensities. There was a 
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Figure 4. The NOESY spectrum of the DDD-GY duplex, showing se- 
quential NOEs between the aromatic and anomeric protons from C' to 
G'~. The spectrum was collected at 10°C at 800 MHz, using a 250 ms 
mixing time. 



small chemical shift difference compared with unmodified 
duplex for the T** and bases. 

The sequential NOEs between the base imino protons 
showed a strong cross-peak between the G NIH 
imino proton and the dPer imino proton 

(u, Figure 5b). The sequential connectivity of the base 
imino protons was thus obtained from base pairs 



>C3:G'°- 



-G^iY"- 



►A*:T^ (Figure 5b). 



The region of the spectrum showing NOEs between the 
base imino and amino protons and adenine H2 protons 
showed cross-peaks for base pairs A^:T^, A'':T^, G^:C", 
G"':C'', and it showed that G and Y' formed a base pair 
(k, 1, m, n, Figure 5a). The G'" cross-peak had a similar 
chemical shift as compared with G^. The G"^ NIH and Y' 
HN resonances were shifted upheld to 10.2 and 10.7 ppm. 

In the NMR spectrum for the DDD-GY duplex 
(Figure 6), two additional resonances were observed at 
10.3 and 10.7 ppm, which were assigned to the G"* and 
Y^ bases. These were broad as compared with the other 
immo resonances. The chemical shift for the G'° NIH 
imino proton was similar to that of the G^ NIH imino 
proton; these two resonances remained sharp even at 
higher temperatures. The thymine T^ N3H imino reson- 
ance remained sharp at higher temperatures as compared 
with the T** N3H imino resonance in the DDD-XY duplex 
(Supplementary Figure S8). 

The assignment of the dPer aromatic protons H4, H5, 
H6, H7, H8 and H9 is shown in Figure 7. These were 
observed between 6.2 and 7.4 ppm. The cross-peak 
between T** HI'^Y'' H9 was identified (10 d. Figure 7d). 
Based on the intensities of cross-peaks, H8 and H7 were 
identified. They both showed cross-peaks to H9 (Id, 3d, 
Figure 7d). The H6 cross-peak was identified based on its 
proximity to H8 and H7 and H5 (2d, 4d, 6d, Figure 7d). 
The H5 proton showed a cross-peak to H4 (7d, Figure 7d). 
This peak was broad and shifted upheld to 6.2 ppm. H9 
and H8 showed cross-peaks to T** H2' (la, 4a, Figure 7a). 
H6 and H5 showed cross-peaks to the T^ CH3 group (2a, 
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Figure 5. The NOESY spectrum for the DDD-GY duplex, (a) 
Interstrand NOEs between complementary bases. The cross-peaks are 
assigned as a, HI^T** N3H; b. A* H2^T8 N3H; c. A* HI'^T' 
N3H; d. A* A^H2^T' N3H; e, A^ H2^T' N3H; f. A*" H2^T^ N3H; 
g, C' W-Hl^G'" NIH; h, C' 7V^H2^G"' NIH; i, C" N'm^G^ 
NIH; j, C" yV^H2^G^ NIH; k, G* Hl'^Y' HN; 1, Y"* HI'^Y' 
HN; m, G" Hl'^G'* NIH; n, Y' Hl'^G'* NIH. (b) NOE connectivity 
for the imino protons for the base pairs G-:C", C^iG'", G'';Y'', A^;T , 
A':T'. The cross-peaks are assigned as T** N3H^T' N3H, T" 
N3H^Y'' HN (p), T** NSH^G'' NIH (o), Y' HN^G'" NIH (s), 
G^ NIH^G'" NIH (r), G^ NIH^G'" NIH and Y' HN^G'* NIH 
(u). Cross-peak (t) could not be assigned. The same cross-peak was 
observed for the DDD-XY duplex (peak k. Supplementary Figure 
S7). A NOESY experiment at a shorter mixing time of 70 ms showed 
no change in intensity and suggested that this may be an exchange 
cross-peak of unknown origin. The experiment was carried out at 
5°C and with a mixing time of 250 ms at 600 MHz. 



3a, Figure 7a). Additional cross-peaks between H9, H8, 
H7 and its deoxyribose and to T*^ deoxyribose protons 
were assigned (Figure 7c). 

The structure of the dG:dPer (DDD-GY) duplex was 
detennined using a simulated anneahng rMD protocol, 
restrained by experimental distance restraints determined 
from NOEs. Supplementary Table S4 shows the restraints 
used for rMD calculations. Nine structures were energy 
minimized and superimposed to obtain the average struc- 
ture (Supplementary Figure S9). Supplementary Figure 
SIO shows these superimposed structures and the 
average structure. The latter was in good agreement with 
the experimental restraints confirmed by CORMA (52) 
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Figure 6. The ID NMR spectra showing the imino proton resonances 
for the DDD-GY duplex as a function of temperature. The individual 
nucleotides are identified as superscripts. Spectra were collected at 
600 MHz. 




Figure 8. The average structure of the DDD-GY duplex, in the region 
of the C^:G'° G^-.Y'^ and A^iT** base pairs. Base Y** is shown in green. 
The dPer ring is oriented in the major groove. It does not disrupt the 
neighbor base pairs. Hydrogens are omitted for clarity. 




Figure 9. The average structure of the G :Y base pair, in the DDD- 
GY duplex. G"* forms a wobble pair with the complementary dPer (Y^) 
base. The anticipated hydrogen bonds are indicated as gray dashed 
lines. 
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analysis. Supplementary Table S5 shows the structural 
statistics. Figure 8 shows the DDD-GY duplex in the 



G'^-.Y'^ and A^:T** base pairs. The 



D1 (ppm) 

Figure 7. The NOESY spectruin showing dPer resonance assignments 
for the DDD-GY duplex. The cross-peaks in (a) are assigned as 
follows: la, T* HI'^Y' H8; 2a, T'* IVle^Y'' H6; 3a, T** Me^Y' 
H5; 4a, T** H2'^Y' H9. The cross-peaks in (b) are assigned as 
follows: lb, T** H2"^Y'' H8; 2b Y** H2"^Y'' H8; 3b, T** H2"^Y'' 
H7; 4b, Y** H2"^Y' H7; 5b, Y' H2'^Y'' H9; 6b, T** H2"^Y'' H9; 7b, 
Y' H2"^Y'' H9. The cross-peaks in (c) are assigned as fohows: Ic, T" 
H3'^Y' H8; 2c, Y** H3'^Y'' H8; 3c, Y'' H5"^Y'' H9; 4c, T** 
H5"^Y' H9; 5c, Y' H5'^Y'' H9; 6c, Y'' H4'^Y'' H9; 7c, T** 
H3'^Y'' H9; 8c, Y'* H3'^Y'' H9. The cross-peaks in (d) are 
assigned as follows: Id, Y' H9^Y'' H8; 3d, Y' H9^Y'' H7; 2d, Y"* 
He^Y' H8; 4d, Y' H6^Y'' H7; 6d, Y' H5^Y'' H6; 5d, Y' H4^Y'' 
H6; 7d, Y' H4^Y'' H5; lOd, T** H6^Y' H9; 9d, Y' HI'^Y' H9; 8d, 
T** Hl'^Y** H9. The spectrum was collected at 10°C, with 250ms 
mixing time, at 800 MHz. 



region of the C^:G'° 

dPer Y' base formed a wobble base pair with the comple- 
mentary guanine G"*, involving two hydrogen bonds 
(Figure 9), which was supported by a strong cross-peak 
between imino protons of G"^ and Y' of opposite strands 
(cross-peak u, Figure 5b). The dPer ring was oriented in 
the major groove and adopted the and conformation 
about the glycosyl bond. It did not disrupt neighbor 
base pairs. The dPer base stacked with its 5' neighbor 
T*^, but it did not stack well with its 3' neighbor G'" 
(Figure 10). The complementary guanine, G"* stacked 
well with its 3' neighbor A^, but not with C^. Helicoidal 
analysis (Supplementary Figures Sll, SI 2, SI 3 and SI 4) 
revealed that the ^ angle of the dPer nucleotide increased 
by ~50° compared with the unmodified duplex, which 
corroborated the reduced stacking between dPer (Y^) 
and the 3' neighbor guanine (G'°) (Supplementary 
Figure S14). 



DISCUSSION 

The dPer synthetic nucleoside (Chart 1) recognizes (9*-Bn- 
dG, as indicated by thermodynamic stabilization of the 
(9^-Bn-dG:dPer interaction (23). The present studies, in 
the DDD sequence context, reveal a 5°C increase in the 
Tyi of the DDD-XY as compared with the DDD-GY 
duplex, which corroborates the previous results. It was 
originally hypothesized that the abihty of dPer to recog- 
nize O^-Bn-dG was due to stacking and hydrophobic 
interactions with the benzyl ring of the DNA adduct, 
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Figure 10. Stacking interactions for tlie DDD-GY duplex, (a) Stacking 
of the C^iG'" base pair (black) above the G'':Y' base pair (green), 
(b) Stacking of G" and Y** (black and green, respectively) above the 
A^iT** base pair (black). The dPer ring is in the major groove. The dPer 
(Y') base stacks with T**. 

combined with potential hydrogen bonding between the 
anti conformation of dPer about the glycosyl bond and 
the Nl and A^^ nitrogen atoms of the alkylated 
deoxyguanosine (23). The present data provide experimen- 
tal evidence for a different mode of recognition. 

dPer recognizes O^-Bn-dG via a stacking interaction 

The data suggest that in this DNA sequence the presence 
of dPer 'traps' the benzyl ring of 0*-Bn-dG between the 
dPer nucleotide and T^", providing a mechanism whereby 
dPer recognizes the 0^-Bn-dG DNA damage (Fi gures 1 
and 2). Crystallographic electron density maps show the 
insertion of the dPer base into the DNA provides a 
binding pocket for the benzyl group of 0^-Bn-dG to inter- 
calate between Per and thymine of the 3'-neighbor A:T 
base pair. The simultaneous insertion of Per and the 
benzyl group of 0^-Bn-dG unwinds the duplex at the rec- 
ognition site (Figure 1 and Supplementary Figure SI), as 
suggested by the weak sequential NOE connectivity cross- 
peak observed between C"* HI' and X"* H8. Additionally, 
the weak cross-peak T** Hl'^Y^ H8 is consistent with an 
increased distance between these bases. The chemical 
shifts of the dPer base resonances, observed in the 
5.5-6.4 ppm range (Figure 3), are consistent with the in- 
sertion of dPer into the duplex and ti-ti stacking with the 
benzyl group of 0^-Bn-dG. Furthermore, the absence of 
the dPer imino resonance in NMR spectra indicates that 
the imino proton is in enhanced exchange with the solv- 
ent, consistent with a lack of base pairing (Supplementary 
Figure S7). 

The orientation of the dPer base about the glycosyl 
bond in this DNA sequence was of interest. The syn- 
glycosyl conformation of the dPer nucleoside was 
observed when it was not incorporated into DNA (23). 
On the other hand, Gong and Sturla (23) had suggested 
hydrogen bonding between the N 1 and N^ nitrogen atoms 
of the alkylated deoxyguanosine and dPer, requiring the 
anti conformation of dPer about the glycosyl bond. In the 
crystallographic data obtained at a resolution of 1.7 A, 
when inserted into the electron density map in the syn 
conformation, the resulting crystallographic R factor 
was minimized. If dPer was instead inserted into the 
electron density map in the anti conformation, it did not 
fit well, and residual difference (Fo-F^) Fourier electron 
density was observed. The NMR data also show a weak 
NOE between the dPer base H9 proton and the deoxyri- 
bose HI' proton, consistent with dPer adopting the syn 



glycosyl torsion angle (Supplementary Figure S6). 
Overall, we conclude that in this DNA sequence, the in- 
sertion of dPer into the duplex is stabilized by a combin- 
ation of base stacking and steric factors. Gong and Sturla 
(23) have observed similar thermodynamic profiles in 
other sequences, which suggests that the mechanisms of 
recognition could be similar. On the other hand, the pos- 
sibility that the combination of stacking and steric factors 
observed in the present structures could be modulated by 
DNA sequence must be considered. For example, one 
might predict a more stable stacking interaction involving 
(9''-Bn-dG if the present 3'-neighbor base A:T base pair 
were to be exchanged for a 3'-neighbor C:G base pair 
(Figures 1 and 2). It would thus be of interest to 
complete a structural analysis(es) of the recognition of 
0^-Bn-dG by dPer in other sequences. 

It seems that the simultaneous insertion of the Per base 
and the 0^-Bn-dG lesion explains the greater volume of 
the crystallographic unit cell (Supplementary Table S2) as 
compared with the canonical DDD, and that changes in 
the crystal packing of the C>^-Bn-dG:dPer duplex explain 
why attempts to phase crystallographic data by the mo- 
lecular replacement method failed. The electron density 
for the two 5'-terminal nucleotides C' and C'^ is not 
visible, suggesting that these bases are disordered in the 
crystal. The terminal bases may be unable to fit into the 
lattice owing to the intercalated structure of the modified 
duplex. 

The observation that dPer recognizes 0*-Bn-dG via a 
stacking interaction rather than via hydrogen bonding 
interactions is consistent with the notion that base 
stacking interactions are of importance in stabilizing 
nucleic acid duplexes and contribute to the sequence de- 
pendence of DNA duplex stability in unmodified DNA 
(53-57). Inter-strand stacking interactions have been 
found to underlie the stabihty of some chemically 
modified DNA duplexes. For example, Gallego and 
Loakes (58) reported on the solution structure and 
dynamics of oligodeoxynucleotide duplexes containing 
the universal base analogs 5-nitroindole and 
5-nitroindole-3-carboxaniide, concluding that these base 
analogs exist as a mixture of two different stacking con- 
figurations. Matsuda et al. (59) reported that for the 
2'-deoxynucleotide containing the propynyhsocarbostyril 
base analog (dPICS), the large aromatic rings of 
propynyhsocarbostyril (dPICS) pair in an intercalative 
manner within an ohgodeoxynucleotide duplex. 
Likewise, Malyshev et al. (60) determined the structure 
of an oligodeoxynucleotide duplex containing the unnat- 
ural dMM02-d5SlCS pair and concluded that this unnat- 
ural base pair adopted a weU-defined structure, with only 
small helical distortions. Their structure revealed that the 
unnatural dMM02-d5SlCS paired via partial interstrand 
intercalation. The intercalation of nucleoside analogs may 
influence behavior in polymerase-mediated DNA synthe- 
sis reactions (59,60). Biphenyl groups placed as a pair in 
a DNA duplex intercalate side by side as a pair between 
the natural base pairs and also undergo dynamic motion 
(61). As well, ohgonucleotides composed of achiral 
non-nucleosidic building blocks, such as pyrene and 
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phenanthrene, embedded in DNA lead to duplex stabil- 
ization on the basis of inter-strand stacking interactions 
(62-64). 

The NMR analysis leads to the conclusion that the 
intercalative recognition mechanism for the O^'-Bn- 
dG:dPer pair applies in solution and provides information 
regarding solution dynamics of the interaction. The obser- 
vation that the benzyl protons of the (9''-Bn-dG appear as 
three resonances (Figure 3) is consistent with rotation of 
the benzyl ring in solution on the timescale of the NMR 
experiment. It is possible that the ring flipping is 
associated with DNA breathing motions, i.e. occurs 
when the DNA duplex is transiently open. This dynamic 
behavior probably accounts for the line broadening at 
base pairs C^G'° and X'^:Y'' in the NMR spectrum 
(Figures 3 and Supplementary Figure S6). The flipping 
benzyl ring between Per and T*^ is consistent with line 
broadening observed both for T** and dPer protons. 
Similar flipping of the styrenyl moiety has been observed 
in the NMR spectrum for the 5(61,2)-i?-(A^-adenyl)styr- 
ene oxide adduct, when placed in DNA (65). DNA con- 
taining the (9*-Bn-G:C pairing is destabilized relative to an 
unmodified G:C base pair (23); however, attempts to char- 
acterize the structure of the 0^-Bn-G lesion in DNA were 
unsuccessful, as the NMR spectra showed spectral 
broadening, which suggested that the lesion induced con- 
formational disorder into the duplex. However, a struc- 
tural analysis of an 0^-Bn-dG modified template: primer 
complexed with the Y-family polymerase Dpo4 revealed 
that (?*-Bn-dG formed a wobble base pair when placed 
opposite dC and pseudo Watson-Crick hydrogen bonding 
when placed opposite dT (66). 

dPer pairs with guanine via a wobble base pairing 
interaction 

The present results reveal formation of a wobble pair 
between dPer and dG, with dPer oriented in the anti 
conformation with respect to the glycosyl bond, with 
hydrogen bonds involving dPer and the Nl and A^" 
nitrogen atoms of the guanine (Figure 9). The presence 
of these hydrogen bonds is consistent with 
the NMR data, which shows that the sequential connect- 
ivity of the base imino protons from base pairs 
C^:G'°^G^Y'*^A^:T** is observed (Figure 5b). 
Moreover, the region of the spectrum showing NOEs 
between the base imino and amino protons (Figure 5) is 
consistent with the notion that G"^ and Y' form a 
wobble-like base pair, as there was no break in the NOE 
connectivity between bases, and the T^^Y' and y'^G'" 
cross-peaks were weak. Notably, the chemical shifts for 
the dPer protons are observed 6.6-7.4 ppm, i.e. further 
downfield than for the DDD-XY duplex, suggesting 
reduced stacking interactions (Figure 7). These downfield 
shifts are consistent with the positioning of the dPer ring 
into the major groove, as seen in Figures 8 and 10. The 
observation that the Tm for the DDD-GY duplex is 5°C 
lower than that for the DDD-XY duplex suggests that the 
stability imparted by this wobble interaction is lower than 
that from the dPer:(9*-Bn-dG intercalative interaction, 
perhaps also due to poorer stacking interactions between 



dPer and the flanking bases (Figure 10), thus providing a 
basis for specificity. The presence of the wobble-pair inter- 
action, however, perhaps limits the selectivity of dPer for 
O'^-Bn-dG over dG. 



SUMMARY 

The synthetic nucleoside dPer distinguishes between 
0^-Bn-dG and dG in this DNA sequence by an 
intercalative binding mode. It enables the benzyl group 
of <9''-Bn-dG to intercalate between dPer and thymine of 
the 3'-neighbor A:T base pair. The binding of the benzyl 
group is captured in the face-to-face stack in the crystal 
structure but is dynamic on the NMR timescale. In 
contrast, dPer forms a less stable pair with dG, which is 
characterized by a wobble-type H-bonding interaction. 
The structural insight gained in this study provides infor- 
mation that may be apphed to chemical modifications that 
could further stabilize dPer:0^-Bn-dG stacking inter- 
actions and/or destabilize the dPer:0^-Bn-dG wobble 
interaction. 
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